Icon

Data_​Cleaning[3290]

Column Filter

The node offers 3 different filtering modes: manually, by name, and by type.
- manually you decide which columns to keep and which to leave out, through Add and Remove buttons.
- by name you decide which columns to keep based on their name through wildcards and Reg Ex.
- by type you decide the columns to keep based on their type, like all Strings or all Integers.

Row Filtering Range CheckingNumber matching criterion Rule Based Row Filter Column Filtering Filter by column name withRegEx and wildcard Filter manually all columnsyou want to keep and all youwant to exclude Filter by column type Matching Missing Values Include &Exclude Pattern matchingString matching criterion Matching row number Include & Exclude Any matching criterion in a collectiontype column Simple Include With wild card & Exclude With wild card & Include Matching RowID value with wldcard &RegExInclude & Exclude Number in range & Include Number in range & Exclude Table Review Create File Table Review Create File String Manipulation Operation 3: Generate newcolumn from values in anexisting column Operation 1: String valuesto upper case letters Operation 2: Round doublevalues Filter by null Table Review Create File Nominal Value and Join You may also chose to exclude TRUEmatches rather than inverting the logicof each rule. A more complex expression. Include rows that match the regularexpression (StateName has a string) AND whose population isless than 5,000,000. Notice that the rule is written on only one line. Include rows that match one of twoindependent expressions. Include rows that match a single, simple expression. AnyStateName value that matches the wildcard "North*" is assigned aTRUE value and passed through the node. Table Review Creating File Table Review Creating File Input some dataDatasetexcludespecific columnexclude by typeall numerical columnsexclude all columns starting with specific letteronly selected column inspecific wordonly sales withquantity < 2& quantity > 5only sales withno missing quantityonly top 10 rowsonly last N-10 rowsonly rowswith RowID starting with Row1* only rowswith RowID not startingwith Row1*selected column in countries ending withspecific letterselected column in countries starting with specific letteronly sales withquantity >=2only sales withquantity >=2 &<=5starting with specific letterand case sensitiveonly selected column incountries notstarting with specific letteronly sales withmissing value in quantityonly sales withquantity = 2all values in one columnas collectionfiltering specific columnDatasetReviewreviewCreatingcreateselected columnto upper caseRound selected columnto two decimalsnew column based on filteringRead airline dataFilter specific row valueFilter specific row valueNode 49ReviewingCreatingRound up age to the nearest 10Replace "-" with "" in native country valuesJoin backDatasetMulti lineexpressionMore complexlogical expressionSimpleexpressionSame as above, but exclude matchesReviewingCreatingCreatingReviewing Table Creator File Reader(deprecated) Column Filter Column Filter Column Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Create CollectionColumn Row Filter File Reader Table View Table View CSV Writer CSV Writer String Manipulation Math Formula(Multi Column) Rule Engine File Reader(deprecated) Nominal ValueRow Filter Nominal ValueRow Filter Missing ValueColumn Filter Table View CSV Writer Math Formula String Manipulation Concatenate File Reader(deprecated) Rule-basedRow Filter Rule-basedRow Filter Rule-basedRow Filter Rule-basedRow Filter Table View CSV Writer CSV Writer Table View Row Filtering Range CheckingNumber matching criterion Rule Based Row Filter Column Filtering Filter by column name withRegEx and wildcard Filter manually all columnsyou want to keep and all youwant to exclude Filter by column type Matching Missing Values Include &Exclude Pattern matchingString matching criterion Matching row number Include & Exclude Any matching criterion in a collectiontype column Simple Include With wild card & Exclude With wild card & Include Matching RowID value with wldcard &RegExInclude & Exclude Number in range & Include Number in range & Exclude Table Review Create File Table Review Create File String Manipulation Operation 3: Generate newcolumn from values in anexisting column Operation 1: String valuesto upper case letters Operation 2: Round doublevalues Filter by null Table Review Create File Nominal Value and Join You may also chose to exclude TRUEmatches rather than inverting the logicof each rule. A more complex expression. Include rows that match the regularexpression (StateName has a string) AND whose population isless than 5,000,000. Notice that the rule is written on only one line. Include rows that match one of twoindependent expressions. Include rows that match a single, simple expression. AnyStateName value that matches the wildcard "North*" is assigned aTRUE value and passed through the node. Table Review Creating File Table Review Creating File Input some dataDatasetexcludespecific columnexclude by typeall numerical columnsexclude all columns starting with specific letteronly selected column inspecific wordonly sales withquantity < 2& quantity > 5only sales withno missing quantityonly top 10 rowsonly last N-10 rowsonly rowswith RowID starting with Row1* only rowswith RowID not startingwith Row1*selected column in countries ending withspecific letterselected column in countries starting with specific letteronly sales withquantity >=2only sales withquantity >=2 &<=5starting with specific letterand case sensitiveonly selected column incountries notstarting with specific letteronly sales withmissing value in quantityonly sales withquantity = 2all values in one columnas collectionfiltering specific columnDatasetReviewreviewCreatingcreateselected columnto upper caseRound selected columnto two decimalsnew column based on filteringRead airline dataFilter specific row valueFilter specific row valueNode 49ReviewingCreatingRound up age to the nearest 10Replace "-" with "" in native country valuesJoin backDatasetMulti lineexpressionMore complexlogical expressionSimpleexpressionSame as above, but exclude matchesReviewingCreatingCreatingReviewing Table Creator File Reader(deprecated) Column Filter Column Filter Column Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Row Filter Create CollectionColumn Row Filter File Reader Table View Table View CSV Writer CSV Writer String Manipulation Math Formula(Multi Column) Rule Engine File Reader(deprecated) Nominal ValueRow Filter Nominal ValueRow Filter Missing ValueColumn Filter Table View CSV Writer Math Formula String Manipulation Concatenate File Reader(deprecated) Rule-basedRow Filter Rule-basedRow Filter Rule-basedRow Filter Rule-basedRow Filter Table View CSV Writer CSV Writer Table View

Nodes

Extensions

Links